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HT001 

wtORF (SEQ ID NO:1) 

MQRPNAHRISQPIRQIIYGLLLNASPHLDKTSWNALPPQPLAFSEVERINKNIRTSllDAVELAKDHSDLSRLTELSLRRRQMLLLETLKV 

KQTILEPIPTSLKLPfAVSCYWLQHTETKAKLHHLQSLLLTMLVGPLIAIINSPGKEELQEDGAKMLYAEFQRVKAQTRLGTRLDLDTAHl 

FCQWQSCLQMGMYLNQLLSTPLPEPDLTRLYSGSLVHGLCQQLLASTSVESLLSICPEAKQLYEYLFNATRSYAPAEIFLPKGRSNSK 

KKRQKKQNTSCSKNRGRTTAHTKCWYEGNNRFGLLMVENLEEHSEASNIE 

(-1)ORF (SEQ ID NO: 2) 

MQRPNAHRISQPIRQIIYGLLLNASPHLDKTSWNALPPQPLAFSEVERINKNIRTSIIDAVELAKDHSDLSRLTELSLRRRQMLLLETLKV 

KQTILEPIPTSLKLPIAVSCYWLQHTETKAKLHHLQSLLLTMLVGPLIAIINSPGKEELQEDGAKMLYAEFQRVKAQTRLGTRLDLDTAHI 

FCQWQSCLQMGMYLNQLLSTPLPEPDLTRLYSGSLVHGLCQQLLASTSVESLLSICPEAKQLYEYLFNATRSYAPAEIFLPKGRSNSK 

K KGRRNRIPAVLRTEGEPLHTPSVGMRETTGLGC 

(+1)/(-2) ORF (SEQ ID NO: 3/118) 

MQRPNAHRISQPIRQIIYGLLLNASPHLDKTSWNALPPQP 

KQTILEPIPTSLKLPIAVSCYWLQHTETKAKLHHLQSLLLTMLVGPLIAIINSPGKEELQEDGAKMLYAEFQRVKAQTRLGTRLDLDTAHI 
FCQWQSCLQMGMYLNQLLSTPLPEPDLTRLYSGSLVHGLCQQLLASTSVESLLSICPEAKQLYEYLFNATRSYAPAEIFLPKGRSNSK 
K K(K)AEETEYQLF 

U79260 

wt ORF (SEQ ID NO: 4) 

MGHPRAIQPSVFFSPYDVHFLLYPIRCPYLKIGRFHIKLKGLHFLFSFLFFFFETQSHSVTRLECSGTISAHCNLCLPGSSNSPASASRV 

AGTAGTCRRAQLIFVFLAEMGFHHVGRDGLDLNLVIHPPRSPKALGLQA 

(-1 )ORF (SEQ ID NO: 5) 

MGHPRAlQPSVFFSPYDVHFLLYPIRCPYLKlGRFHIKLKGLHFLFSFLFF FZ_/?HSLT 

LGLQARAAAPS 

(+1)/(-2)ORF (SEQ ID NO: 6) 

MGHPRAIQPSVFFSPYDVHFLLYPIRCPYLKIGRFHIKLKGLHFLFSFLFFF(F) 

PTHL3 

(wt)ORF (SEQ ID NO: 7) 

MQRRLVQQWSVAVFLLSYAVPSCGRSVEGLSRRLKRAVSEHQLLHDKGKSIQDLRRRFFLHHUAEIHTAEIRATSEVSPNSKPSPNT 
KNHPVRFGSDDEGRYLTQETNKVETYKEQPLKTPGKKKKGKPGKRKEQEKKKRRTRSAWLDSGVTGSGLEGDHLSDTSTTSLELD 
SRTALLWGLKKKKENNRRTHHMQLMISLFKSPLLLL 
(-1)ORF (SEQ ID NO: 8) 

MQRRLVQQWSVAVFLLSYAVPSCGRSVEGLSRRLKRAVSEHQLLHDKGKSIQDLRRRFFLHHUAEIHTAEIRATSEVSPNSKPSPNT 
KNHPVRFGSDDEGRYLTQETNKVETYKEQPLKTPGKKKKGKPGKRKEQEKKKRRTRSAWLDSGVTGSGLEGDHLSDTSTTSLELD 
SRTALLWGLKK KRK7TEEH//C/V 
(+1)/(~2)ORF (SEQ ID NO: 9) 

MQRRLVQQWSVAVFLLSYAVPSCGRSVEGLSRRLKRAVSEHQLLHDKGKSIQDLRRRFFLHHLiAEIHTAEIRATSEVSPNSKPSPNT 
KNHPVRFGSDDEGRYLTQETNKVETYKEQPLKTPGKKKKGKPGKRKEQEKKKRRTRSAWLDSGVTGSGLEGDHLSDTSTTSLELD 
SRTALLWGLKKK(K )GKQQKNTSYATNDLII 

TGFbRII 

(wt)(SEQ ID NO: 10) 

MGRGLLRGLWPLHIVLVVTRIASTIPPHVQKSVNNDMIVTDNNGAVKFPQLCKFCDVRFSTCDNQKSCMSNCSITSICEKPQEVCVAV 
WRKNDENITLEWCHDPKLPYHDFILEDAASPKCIMKEKKKPGETFFMCSCSSDECNDNlfFSEEYNTSNPDLLLVIFQWGISLLPPLG 
VAISViilFYCYRVNRQQKLSSTWETGKTRKLMEFSEHCAIILEDDRSDiSSTCANNINHNTELLPIELDTLVGKGRFAEWKAKLKQNTS 
EQFEWAVKIFPYEEYASWKTEKDIFSDINLKHENILQFLTAEERKTELGKQYWUTAFHAKGNLQEYLTRHVISWEDLRKLG^ 
AHLHSDHTPCGRPKMPIVHRDLNSSNILVKNDLTCCLCDFGLSLRLDPTLSVDDLANSGQVGTARYMAPEVLESRMNLENAESFKQT 
DVYSMALVLWEMTSRCNAVGEVKDYEPPFGSKVREHPCVESMKDNVLRDRGRPEIPSFWLNHQGIQMVCETLTECWDHDPEARLT 
AQCVAERFSELEHLDRLSGRSCSEEKIPEDGSLNTTK 
(-1)ORF (SEQ ID NO: 11) 

MGRGLLRGLWPLHIVLVVTRIASTIPPHVQKSVNNDMIVTDNNGAVKFPQLCKFCDVRFSTCDNQKSCMSNCSITSICEKPQEVCVAV 
WRKNDENITLEWCHDPKLPYHDFILEDAASFKCIMKEK K^^ 
(+1)/(-2)ORF (SEQ ID NO: 12/119) 

MGRGLLRGLWPLHIVLVVTRIASTIPPHVQKSVNNDMIVTDNNGAVKFPQLCKFCDVRFSTCDNQKSCMSNCSITSICEKPQEVCVAV 
WRKNDEN!TLETVCHDPKLPYHDFILEDAASPKClMKEKK(K)yW 

MACS 

(wt)ORF (SEQ ID NO: 13) 

MGAQFSKTAAKGEAAAERPGEAAVASSPSKANGQENGHVKVNGDASPAAAESGAKE 

AEKGEPAAAAAPEAGASPVEKEAPAEGEAAEPGSATAAEGEAASAASSTSSPKAEDGATPSPSNETPKKKKKRFSFKKSFKLSGFS 
FKKNKKEAGEGGEAEAPAAEGGKDEAAGGAAAAAAEAGAASGEQAAAPGEEAAAGEEGAAGGDPQEAKPQEAAVAPEKPPASDE 
TKAAEEPSKVEEKKAEEAGASAAACEAPSAAGPGAPPEQEAAPAEEPAAAAASSACAAPSQEAQPECSPEAPPAEAAE 
(-1)ORF (SEQ ID NO: 14) 

MGAQFSKTAAKGEAAAERPGEAAVASSPSKANGQENGHVKVNGDASPAAAESGAKEELQANGSAPAADKEEPAAAGSGAASPSS 
AEKGEPAAAAAPEAGASPVEKEAPAEGEAAEPGSATAAEGEAASAASSTSSPKAEDGATPSPSNETPKK KRSvAFPSRSLSS 
(+1)/(-2)ORF (SEQ ID NO: 15) 
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MGAQFSKTAAKGEAAAERPGEAAVASSPSKANGQENGH^ 

AEKGEPAAAAAPEAGASPVEKEAPAEGEAAEPGSATAAEGEAASA^ 

LLQEEQEGGWRRR 

TCF-4 

(wt)ORF (SEQ ID NO: 16) 

MPQLNGGGGDDLGANDELISFKDEGEQEEKSSENSSAERDLADVKSSLVNESETNQNSSSDSEAERRPPPRSESFRDKSRESLEEA 
AKRGDGGLFKGPPYPGYPFIMIPDLTSPYLPNGSLSPTARTYLQMKWPLLDVQAGSW 

VHPLTPLITYSNEHFTPGNPPPHLPADVDPKTGIPRPPHPPDISPYYPLSPGTVGQIPHPLGWLVPQQGQPVYPITTGGFRHPYPTALT 

VNASVSRFPPHMVPPHHTLHTTGIPHPAIVTPWKQESSQSDVGSLHSSKHQDSKKEEEKKKPHIKKPLNAFMLYMKE 

TLKESAAINQILGRRWHALSREEQAKYYELARKERQLHMQLYPGWSARDNYGKKKKRKRDKQPGETNEHSECFLNPCLSLPPITDLS 

APKKCRARFGLDQQNNWCGPCRRKKKCVRYIQGEGSCLSPPSSDGSLLDSPPPSPNLLGSPPRDAKSQTEQTQPLSLSLKPDPLAH 

LSMMPPPPALLU\EATHKASALCPNGALDLPPAALQPAAPSSSIAQPSTSWLHSHSS!^GTQPQPLSLVTKSLE 

(«1)ORF (SEQ ID NO: 17) 

MPQLNGGGGDDLGANDELISFKDEGEQEEKSSENSSAERDLADVKSSLVNESETNQNSSSDSEAERRPPPRSESFRDKSRESLEEA 
AKRQDGGLFKGPPYPGYPFIMIPDLTSPYLPNGSLSPTARTYLQMKWPLL^ 

VHPLTPUTYSNEHFTPGNPPPHLPADVDPKTGiPRPPHPPDISPYYPLSPGTVGQIPHPLGWLVPQQGQPVYPITTGGFRHPYPTALT 
VNASVSRFPPHMVPPHHTLHTTGIPHPAIVTPW^ 

TLKESAAINQILGRRWHALSREEQAKYYELARKERQLHMQLYPGWSARDNYGKKKKRKRDKQPGETNEHSECFLNPCLSLPPITDLS 

APKKCRARFGLDQQMUWCGPCRRK KSAFATYKVKAAASAHPLQMEAY 

(+1)/(-2)ORF (SEQ ID NO: 18) 

MPQLNGGGGDDLGANDELISFKDEGEQEEKSSENSSAERDLADVKSSLVNESETNQNSSSDSEAERRPPPRSESFRDKSRESLEEA 
AKRQDGGLFKGPPYPGYPFIMIPDLTSPYLPNGSLSPTARTYLQMKWPLLDVQAGSLQSRQALKDARSPSPAHIVSNKVPWQHPHH 
VHPLTPLITYSNEHFTPGNPPPHLPADVDPKTGIPRPPHPPDISPYYPLSPGTVGQIPHPLGWLVPQQGQPVYPITTGGFRHPYPTALT 
VNASVSRFPPHMVPPHHTLHTTGIPHPAIWPWKQESSQSDVGSLHSSKHQDSKKEEEKKKPHIKKPLNAFMLYMKEMRAKWAEC 
TLKESAAINQILGRRWHALSREEQAKWELARKERQLHMQLYPGWSARDNYGKKKKRKRDKQPGETNEHSECFLNPCLSLPPITDLS 

APKKCF^RFGLDQQNNWCGPCRRKK(K)WSLH7E 

TAF1b 

(wt)ORF (SEQ ID NO: 19) 

IPAFPAGTVLQPFPEAALATRVTVPAVEAPAAPRLDLEESEEFKERCTQCAAVS 

QIKALNRGLKKKNNTEKGWDWWCEGFQYILYQCV\EALKNLGVGPELKNDVLHNFWKRYLQKSKQAYCKNP\ATTGRKPW 

SHSDWASEPELLSDVSCPPFLESGAESQSDIHTRKPFPVSKASQSETSVCSGSLDGVEYSQRKEKGIVKMTMPQTLAFCYLSLLWQ 

REAITLSDLLRFVEEDHIPYINAFQHFPEQMKLYGRDRGIFGIESWPDYEDIYKKTIEVGTFLDLPRFPDITEDCYLHPNILCMKYLMEVN 

LPDEMHSLTCHWKMTGMGEVDFLTFDPIAKMAKAVKYDVQAVAIINAA/LKLLFLMDDSFEWSLSNLAEKH 

YQIMKKAFDEKKQKWEEARAKYLWKSEKPLYYSFVDKPVAYKKREMWNLQKQFSTLVDSTATAGKKSPSSFQFNVVTEEDTDR 

FHGHSLQGVLKEKGQSLLTKNSLYWLSTQKFCRW 

(-1)ORF (SEQ ID NO: 20) 

IPAFPAGTVLQPFPEAALATRVWPAVEAPAAPRLDLEESEEFKERCTQCAAVSW 

QIKALNRGLKK KTILKKAGIGMCVKVSSIFFINKQKP 

(+1)/(-2)ORF (SEQ ID NO: 21/120) 

IPAFPAGWLQPFPEAAl^TRVWPAVEAPAAPRLDLEESEEFKERCT 
QIKALNRGLKKKiSQY 

AC-1 

(wt)ORF (SEQ ID NO: 22) 

MDTQKQIHKTHNSKNQFFTIFFFLSVEFGKEGTRKNFYLLLSIGHYGRKSRRADLGTADTADKTEPECFAASmFDPNPSNm/SGAHS 
TAVHQ 

(-1)ORF (SEQ ID NO: 23) 

MPTQKQIHKTHNSKNQFFTIF Fjj>CQy^ 
(+1 )/(-2)ORF (SEQ ID NO: 24) 
MDTQKQIHKTHNSKNQFFTIFF(F)P\/S 

Sec63 

(wt)ORF (SEQ ID NO: 25) 

MAGQQFQYDDSGNTFFYFLTSFVGUVIPATYYLWPRDQNAEQIRLKNIRK^ 

AYKVSKTDREYQEYNPYEVLNLDPGAWAEIKKQYRLLSLKYHPDKGGDEVMFMRIAKAYAALTDEESRKNWEEFGNPDGPQATSF 

GIALPAWIVDQKNSILVLLVYGLAFMVILPNAA/GSVVVVYRSIRYSGDQILIRTTQIYTYFWKTRNM 

SRPTDNILIPQUREIGSINLKKNEPPLTCPYSLKARVLL^^ 

APTLASLENCMKLSQMAVQGLQQFKSPLLQLPHIEEDNLRRVSNHKKYKIKTIQDLVSLKESDRHTLLHFLEDEKYEEVMAVLGSFPY 

WMDIKSQVLDDEDSNNIWGSLVTVLVKLTRQTMAEVFEKEQSICAAEEQPAEDGQGETNKNRTKGGWQQKSKGPKKTAKSKKKK 

PLKKKPTPVLLPQSKQQKQKQANGWGNEAAVKEDEEEVSDKGSDSEEEETNRDSQSEKDDGSDRDSDREQDEKQNKDDEAEW 

QELQQSIQRKERALLETKSKITHPWSLYFPEEKQEWWWLYIADRKEQTLISMPYHVCTLKDTEEVELKFPAPGKPGNYQYTVFLRSD 

SYMGLDQIKPLKLEVHEAKPVPENHPQWDTAIEGDEDQEDSEGFEDSFEEEEEEEEDDD 

(-1) 9er A-Repeat (SEQ ID NO: 26) 

MAGQQFQYDDSGNTFFYFLTSFVGLIVIPATYYLWPRDQNAEQIRLKNiRKWGRCMWYRLRLLKPQPNIIPTVKKiVLLAGWALFLFL 
AYKVSKTDREYQEYNPYEVLNLDPGATVAEIKKQYRLLSLKYHPDKGGDEVMFMRIAKAYAALTDEESRKNWEEFGNPDGPQATSF 
GIALPAWiVDQKNSILVLLWGLAFMVILPVWGSWV^ 

SRPTDNILIPQLIREIGSINLKKNEPPLTCPYSLKARVLLLSHUVRMKIPETLEEDQQFMLKKCPALLQEMVNViCQLIVMARN 
APTI^SLENCMKLSQMAVQGLQQFKSPLLQLPHIEEDNLRRVSNHKKYKIKTIQDLVSLKESDRHTLLHFLEDEKYEEVMAVLGSFPY 
WMDIKSQVLDDEDSNNIWGSLNm/LVKLTRQTMAEVFEKEQSlCAAEEQPAEDGQGETNKNRTKGGWQQKSKGPKKTAKSKKM 
L 
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(+1)/(~2) 9er A-Repeat (SEQ ID NO: 27) 

MAGQQFQYDDSGNTFFYFLTSFVGLIVIPATYYLWPRDQNAEQIRLKNIRKWGRCMWYRLRLLKPQPNIIPTVKKIVLLAGWALFLFL 
AYKVSKTDREYQEYNPYEVLNLDPGAWAEIKKQYRLLSLKYHPDKGGDEVMFMRIAKAYAALTDEESRKNWEEFGNPDGPQATSF 
GlALPAWIVDQKNSILVLLWGLAFMVILPNAA/GSWWYRSlRYSGDQILiRTTQlYTYFWKTRNMDM 
SRPTDNIUPQLIREIGSINLKKNEPPLTCPYSLKAR^ 

APTLASLENCMKLSQMAVQGLQQFKSPLLQLPHIEEDNLRRVSNHKKYKIKJIQDLVSLKESDRHTLLHFLEDEKYEEVMAVLGSFPY 
WMDIKSQVLDDEDSNNITVGSLVTVLVKLTRQTMAEVFEKEQSICAAEEQPAEDGQGETNKNRTKGGWQQ 
KSKGPKKTAKSKK(K )ETFKKKTYTCAITTVKA TETKA GKWSRWE 
(-1) 10er A-Repeat (SEQ ID NO: 28) 

MAGQQFQYDDSGNTFFYFLTSFVGUVIPATYYLWPRDQNAEQIRLK^ 

AYKVSKTDREYQEYNPYEVLNLDPGATVAEIKKQYRLLSLKYHPDKGGDEVMFMRIAKAYAALTDEESRKNWEEFGNPDGPQATSF 
GIALPAWIVDQKNSILVLLWGLAFMVILPVWGSWWYRSIRYSGDQIU^ 

SRPTDNIUPQLIREIGSINLKKNEPPLTCPYSLKARVLLLSHLARMKIPETLEEDQQFMLKKCPALLQEMVNVICQLIVMARNREEREFR 

APTU\SLENCMKLSQMAVQGLQQFKSPLLQLPHIEEDNLRRVSNHKKYKIKTIQDLVSLKESDRHTLLHFLEDEKYEEVMAVLGSFPY 

WMDIKSQVLDDEDSNNITVGSLVTVLVKLTRQTMAEVFEKEQS!CAAEEQPAEDGQGETNKNRTKGGWQQ 

KSKGPKKTAKSKKKKPLK KNLHLCYYHSQSNRNKSRQMESLGMKLQ 

(+1)/(-2) 10er A-Repeat (SEQ ID NO: 29) 

MAGQQFQYDDSGNTFFYFLTSFVGLIVIPATYYLWPRDQNAEQIRLKNIRKW 

AYKVSKTDREYQEYNPYEVLNLDPGATVAEiKKQYRLLSLKYHPDKGGDEVMFMRIAKAYAALTDEESRKNWEEFGNPDGPQATSF 

GIALPAWIVDQKNSILVLLWGLAFMVILPVWGSWWYRSIRYSGDQILIRTTQIYTYFWKTRNMDM 

SRPTDNILIPQLIREIGSINLKKNEPPLTCPYSLKARVLLLSHLARMKIPETLEEDQQFMLKKCPALLQEMVNVICQLIVM 

APTLASLENCMKLSQMAVQGLQQFKSPLLQLPHIEEDNLRRVSNHKKYKIKTIQDLVSLKESDRHTLLHFLEDEKYEEVMAVLGSFPY 

VTMDIKSQVLDDEDSNNITVGSLVTVLVKLTRQTMAEVFEKEQSICAAEEQPAEDGQGETNKNRTKGGWQQ 

K^KGPKKTAKBKKKKPLKK(K )TYTCAtTTVKATETKAGKWSRWE 

Caspase 5 

(wt)ORF (SEQ ID NO: 30) 

MFKGILQSGLDNF^INHMLKNNVAGQTSIQTLVPNTDQKSTSVKKDNHKKKWKMLEYLGKDVLHGVFNYLAKHDVLTLKEEEKKKYY 
DAKIEDKAULVDSLRKNRVAHQMFTQTLLNM 

LIICNTKFDHLPARNGAHYDIVGMKRLLQGLGYTWDEKNLTARDMESVLRAFAARPEHKSSDSTFLVLMSHGILEGICGTAHKKKKPD 
VLLYDTIFQIFNNRNCLSLKDKPKVIIVQACRGEKHGELWVRDSPASLAVISSQSSENLEADSVCKIHEEKDFIAFCSSTPHNVSWRDR 
TRGSIFITEUTCFQKYSCCCHLMEIFRKVQKSFEVPQAKAQMPTIERATLTRDFYLFPGN 
(-1)ORF (SEQ ID NO: 31) 

MFKGH QSGl DNFVINHMI KNNVAGQTSIQTLVflWDQKSTSVKKPNHKK K^^ 
(+1)/(-2)ORF (SEQ ID NO: 32) 

MFKGILQSGLDNFVINHMLKNNVAGQTSIQTLVPNTDQKSTSVKKDNHKK(K)/VS 

AIM2 

(wt)ORF (SEQ ID NO: 33) 

MESKYKEILLLTGLDNITDEELDRFKFFLSDEFNIATGKLHTANRIQVATLMIQNAGAVSAVMKTIRIFQKLNYMLLAKRLQEEKEKVDKQ 
YKSWKPKPLSQAEMSPAASAAIRNDVAKQRAAPKVSPHVKPEQKQMVAQQESIREGFQKRCLPVMVLKAKKPFTFETQEGKQEMF 
HATVATEKE FFFVKVFNTLLKDKFI P KRI 1 1 I ARYYRHSGFLE VNS ASRVL.DAESDQKVN VPLNI I RKAGETP Kl NTLQTQPLGTI VNGLFV 
VQKWEKKKNILFDLSDNTGKMEVLGVRNEDTMKCKEGDKVRLTFFTLSKNGEKLQLTSGVHSTIKVIKAKKKT 
(-1)ORF (SEQ ID NO: 34) 

MESKYKEiLLLTGLDNITDEELDRFKFFLSDEFNIATGKLHTANRIQVATLMIQNAGAVSAVMKTiRIFQKLNYMLLAKRLQEEKEKVDKQ 
YKSWKPKPLSQAEMSPAASAAIRNDVAKQRAAPKVSPHVKPEQKQMVAQQESIREGFQKRCLPVMVLKAKKPFTFETQEGKQEMF 
HATVATEKEFFFVKVFNTLLKDKFIPKRIIIIARYYRHSGFLEVNSASRVLDAESDQKVNVPLNIIRKAGETPKINTLQTQPLGTIVNGLFV 
VOKWEKKKMiLFni^nh^T^^^P^^ GVRNEPTMKCKEGDKVRl TFFTl ^KNGEKI QLTSGVHSTlKVIKAKK KHf?£VKRT/VSSQL\/ 
(+1)/(-2)ORF (SEQ ID NO: 35) 

MESKYKEILLLTGLDNITDEELDRFKFFLSDEFNIATGKLH^^ 

YKSVTKPKPLSQAEMSPAASAAIRNDVAKQRAAPKVSPHVKPEQKQMVAQQESIREGFQKRCLPVMVLKAKKPFTFETQEGKQEM 
HATVATEKEFFFVKVFNTLLKDKFIPKR!l!IARYYRHSGFLEVNSASR\^DAESDQKVNVPLNIIRKAGETPKINTLQTQPLGTIVNGLFV 
VQKWEKKKNILFDLSDNTGKMEVLGVRNEDTMKCKEGDKVRLTFFTLSKNGEKLQLTSGVHSTIKVIKAKK(K)MH< 

SLC23A1 

(wt)ORF (SEQ ID NO: 36) 

MMGiGKNTTSKSMEAGSSTEGKYEDEAKHPAFFTLPWINGGATSSGEQDNEDTELMAIYTTENGIAEKSSLAETLDSTGSLDPQRS 
DMIYTIEDVPPWYLCIFLGLQHYLTCFSGTIAVPFLLAD 

!LSLDKWKCNTTDVSVANGTAELLHTEHI\A^PRIREIQGAIIMSSLIEWIGLLGLPGALLKYIGPLTITPWALlGLSGFQAAGE 

GIAMLTIFLVLLFSQYARNVKFPLPIYKSKKGmAYKLQLFKMFPIILAILVSWLLCFIFTVTDVFPPDSTKYGFYARTDARQGVLLVAPW 

FKVPYPFQWGLPWSAAGVIGMLSAWASIIESIGDYYACARLSCAPPPPIHAINRGIFVEGLSCVLDGIFGTGNGSTSSSPNIGVLGITK 

VGSRRVIQCGAALMLALGMIGKFSALFASLPDPVLGALFCTLFGMITAVGLSNLQFIDLNSSRNLFVLGFSIFFGLVLPSYLRQNPLVTGI 

TGIDQVLNVLLTTAMFVGGCVAFILDNTIPGTPEERGIRKWKKGVGKGNKSLDGMESYNLPFGMNHKKYRCFSYLPISPTFVGYTWK 

GLRKSDNSRSSDEDSQATG 

(-1)ORF (SEQ ID NO: 37) 

MMGIGKNTTSKSMEAGSSTEGKYEDEAKHPAFFTLPWING 
DMIYTIEDVPPWYLCIFLGLQHYLTCFSGTIAVPFLU\DAMCVGYDQW 

ILSLDKWKCNTTDVSVANGTAELLHTEHIWYPRIREIQGAIIMSSLIEWIGLLGLPGALLKYIGPLTITPWALIGLSGF 
GIAMLTIFLVLLFSQYARNVKFPLPIYKSKKGmAYKLQLFKMFPIILAILVSWLLCFlFTWDVFPPDSTKYGFYARTDARQGVLLVAPW 
FKVPYPFQWGLPWSAAGVIGMLSAWASIIESIGDYYACARLSCAPPPPSTQ 
(+1)/(~2)ORF (SEQ ID NO: 38) 
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MMGIGKNTTSKSMEAGSSTEGKYEDEAKHPAFFTLPW1NGGATSSGEQDNEDTELMAIYTTENG1AEKSSLAETLDSTGSLDPQRS 

DMIYTIEDVPPWYLCIFLGLQHYLTCFSGTIAVPFLL^ 

ILSLDKWKCNTTDVSVANGTAELLHTEHIWYPRIR 

GIAMLTlFLVLLFSQYARNVKFPLPIYKSKKGmAYKLQLFKMFPIIU\ILVSWLLCFIFTWDVFPPDSTKYGFYARTDARQGVLLVAPW 

FKVPYPFQWGLPTVSAAGVIGMLSAWASIIESIGDYYACARLSCAPPP 

(P )HPRNKQGNFRGRPLLCS 

ABCF1 

(wt)ORF (SEQ ID NO: 39) 

MPKAPKQQPPEPEWIGDGESTSPSDKWKKGKKDKKiKKTFFEELAVEDKQAGEEEKVLKEKEQQQQQQQQQQKKKRDTRKGRR 
KKDVDDDGEEKELMERLKKLSVPTSDEEDEVPAPKPRGGKKTKGGNVFAALIQDQSEEEEEEEKHPPKPAKPEKNRINKAVSEEQQ 
PALKGKKGKEEKSKGKAKPQNKFAALDNEEEDKEEEHKEKEPPKQGKE^^ 

QAMLENASDIKLEKFSISAHGKELFVNADLYiVAGRRYGLVGPNGKGKTTLLKHIANRALSIPPNIDVLLCEQEWADETPAVQAVLRAD 
TKRLKLLEEERRLQGQLEQGDDTAAERLEKWEELRATGAAAAEAK^^ 

TLLMLDEPTNHLDLNAVIWLNNYLQGWRKTLLIVSHDQGFLDDVCTDIIHLDAQRLHYYRGNYMTFKKMYQQKQKELLKQYEKQEKKL 

KELKAGGKSTKQAEKQTKEALTRKQQKCRRKNQDEESQEAPELLKRPKEYTVRFTFPDPPPLSPPVLGLHGVTFGYQGQKPLFKNL 

DFGIDMDSRICIVGPNGVGKSTLLLLLTGKLTPTHGEMRKNHRLKIGFFNQQYAEQLRMEETPTEYLQRGFNLPYQDARKCLGRFGLE 

SHAHTIQICKLSGGQKARWFAELACREPDVLILDEPTNNLDIESIDALGEAINEYKGAVIWSHDARLITETNCQLWWEEQSVSQIDG 

DFEDYKREVLEALGEVMVSRPRE 

(-1)ORF (SEQ ID NO: 40) 

MPKAPKQOPPFPFWIGDGESTSPSDKWKKGKKDKKIKKTFFEELAVEDKQAGEEEKVLKEKEQQQQQQQQQQK KSE/PE/CAGGR 
RMWMMMEKRKSS WS VLRSSQCQPVMRRMKYPPQNPAEGRKPR WMFLQP 
(+1)/(-2)ORF (SEQ ID NO: 41) 

MPKAPKQOPPFPPWIGDGESTSPSDKWKKGKKDKKfKKTFFEELAVEPKQAGEEEKVLKEKEQQQQQQQQQQKKrK ^RYPKRQyA 
EEGCG 

HSPC259 

(wt)ORF (SEQ ID NO: 42) 

SPDYFPQISSQFGTVEK - ???- 

MEKIFISSSTKAEGKGISPFEAPINTC^PPEKGKEAWQEPERSWFQTKEERKKEKIAKALQEFDL^LRGKKKRKKFMKDAKKKGEMT 

AEERSQFEILKAQMFAERLAKRNRRAKRARAMPEEEPVRGPAKKQKQGKKSVFDEELTNTSKKALKQYRAGPSFEERKQLGLPHQR 

RGGNFKSNPDTRGGSSCRGLKKFMGAALKSLPCGKSSWLVCLFSICLKKKQKQKTTLWCGGMVRSYFPKHVCQSPFLLISFHMTIL 

NGSIFGKRE 

(-1)ORF (SEQ ID NO: 43) 

MEKIFISSSTKAEGKGISPFEAP]NT<^PPEKGKEAW 

AEERSQFEILKAQMFAERLAKRNRRAKRARAMPEEEPVRGPAKKQKQGKKSVFDEELTNTSKKALKQYRAGPSFEERKQLGLPH 

RnGNFKSNPDTRGGSSCRGLKKFMGAALKSLPCGKSSWLVCLFSlCL^ 

(+1)/(~2)ORF (SEQ ID NO: 44) 

MEKIFISSSTKAEGKGISPFEAPINTQAPPEKGKEAWQEPERSWFQTKEERKKEKIAKALQEFDLALRGKKKRKKFMKDAKKKGEM 

AEERSQFEILKAQMFAERI^KRNRRAKRAF^MPEEEPVRGPAKKQKQGKKSVFDEELTNTSKKALKQYRAGPS 

RGGNFKSNPDTRGGSSCRGLKKFMGAALKSLPCGKSSWLVCLFSICLKK(K )TKrKA/NrL\/l/lWYGr 

Bax 

(wt)ORF (SEQ ID NO: 45) 

MDGSGEQPRGGGPTSSEQIMKTGALLLQGFIQDRAGRMGGEAPELALDPVPQDASTKKLSECLKRIGDELDSNMELQRMIAAVDTD 
SPREVFFRVAADMFSDGNFNWGRWALFYFASKLVLKALCTKVPELiRTIMGWTLDFLRERLLGWIQDQGGWDGLLSYFGTPTWQT 
VTIFVAGVLTASLTIWKKMG 
(-1)ORF (SEQ ID NO: 46) 

MDGSGEQPRGGGPTSSEQIMKTGALLLQGFIQDF^GRMG GRHPSl4/Pm-RC/-RMRPPRS 
(+1)/(~2)ORF (SEQ ID NO: 47) 

MPGSGEQPRGGGPTSSEQIMKTGALLLQGFlQDRAGRMG G^GjGTPAGPGPG/ASGCVHQSAERVSQyAHRGRTGQ 

TCF6L1 

(wt)ORF (SEQ ID NO: 48) 

MAFLRSMWGVLSALGRSGAELCTGCGSRLRSPFSFWLPRWFSSVLASCPKKPVSSYLRFSKEQLPIFKAQNPDAKTTELIRRIAQR 
WRELPDSKKKIYQDAYRAEWQWKEEISRFKEQLTPSQIMSLEKEIMDKHLKRKAMTKKKELTLLGKPKRPRSAYNVYVAERFQEAK 
GDSPQEKLKTVKENWKNLSDSEKELYIQHAKEDETRYHNEMKSWEEQMIEVGRKDLLRRTIKKQRKYGAEEC 
(-1)ORF (SEQ ID NO: 49) 

MAFLRSMWGVLSALGRSGAELCTGCGSRLRSPFSFWLPRWFSSVU\SCPKKPVSSYLRFSKEQLPIFKAQNPDAKTTELIRRIAQR 

WRELPDSKKKIYQDAYRAEWQVYKEEISRFKEQLTPSQIMSLEKEIMDKHLKRKAMTKKKS 

(+1)/(-2)ORF (SEQ ID NO: 50) 

MAFLRSMWGVLSALGRSGAELCTGCGSRLRSPFSFWLPRWFSSVLASCPKKPVSSYLRFSKEQLPIFKAQNPDAKTTELIRRIAQR 
WRFl PDSKKKIYQPAYRAEWQWKEEISRFKEQLTPSQIMSLEKEIMDKHLKRKAMTKK(K )jRV^^ 

FTL3L 

(wt)ORF (SEQ ID NO: 51) 

MWLAPAWSPTTYLLLLLLLSSGLSGTQDCSFQHSPISSDFAVKIRELS 

VAGSKMQGLLERVNTEIHFVTKCAFQPPPSCLRFVQTNISRLLQETSEQLVALKPWITRQNFSRCLELQCQPDSSTLPPPWSPRPLE 

ATAPTAPQPPLLLLLLLPVGLLLLAAAWCLHWQRTRRRTPRPGEQVPPVPSPQDLLLVEH 

(-1)ORF(SEQIDNO:52) 

MWLAPAWSPTTYLLLLLLLSSGLSGTQDCSFQHSPISSDFAVKIRELSDYLLQDY 

VAGSKMQGLLERVNTEIHFVTKCAFQ 

PPP A VFASSRPTSPASCRRPPSSWWR 
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(+1)/(-2)ORF (SEQ ID NO: 53) 

MWU\PAWSPTTYLLLLLLLSSGLSGTQDCSFQHSPISSDFAV 
VAGSKMQGLLE R VNTE I H F VTKC AFQ 

PP(P )QLSSLRPDQHLPPPAGDLRAAGGAEALDHSPELLPVPGAAVSAR 

OGT 

(wt)ORF (SEQ ID NO: 54) 

MLQGHFWLVREGiMISPSSPPPPNLFFFPLQIFPFPFTSFPSHLLSLTPP 
KAVTLDPNFLDAYINLGNVLKEARIFDRAVAAYLRALSL^ 

KGSVAEAEDCYNTALRLCPTHADSLNNLANIKREQGNIEEAVRLYRKALEVFPEFAAAHSNLASVLQQQGKLQEALMHYKEAIRISPTF 

ADAYSNMGNTLKEMQDVQGALQCYTRAiQiNPAFADAHSNLASIHKDSGNIPEAIASYRTALKLKPDFPDAYCNLAHCLQiVCDWTDY 

DERMKKLVSIVADQLEKNRLPSVHPHHSMLYPLSHGFRKAIAERHGNLCLDKINVLHKPPYEHPKDLKLSDGRLRVGYVSSDFGNHPT 

SHLMQSIPGMHNPDKFEVFCYALSPDDGTNFRVKVMAEANHFIDLSQIPCNGKAADRIHQDGIHILVNMNGYTKGARNELFALRPAPI 

QAMWLGYPGTSGALFMDYIITDQETSPAEVAEQYSEKLAYMPHTFFIGDHANMFPHLKKKAVIDFKSNGHIYDNRIVLNGIDLKAFLDS 

LPDVKlVKMKCPDGGDNADSSNTALNMPViPMNTiAEAViEMlNRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIi\m^ 

EDAIVYCNFNQLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPKEEHVRRGQLADVCLDTPL 

CNGHTTGMDVLWAGTPMWMPGETIJ\SRVAASQLTCLGCLELIA 

MELERLYLQMWEHYAAGNKPDHMIKPVEVTESA 

(-1)ORF(SEQIDNO:55) 

MLQGHFWLVREGiMlSPSSPPPPNLF FSL YKFSPFPLPPFPPIFFH 
(+1)/(«2)ORF (SEQ ID NO: 56) 

MLQGHFWL\/REGM\SPSSPPPPNLFF(F )PFTNFPLSLYLLSLPSSFINPS 

ELAVL3 

(wt)ORF (SEQ ID NO: 57) 

MESQVGGGPAGRPAQRPLLGTNGATDDSKTNLIVNYLPQNMTQDEFKSLFGSIGDIESCKLVRDKITGQSLGYGFVNYSDPNDADKA 
INTLNGLKLQTKTIKVSYARPSSASIRDANLYVSGLPKTMSQKEIVIEQLFSQYGRIITSRILVDQVTGVSRGVGFIRFDKRIEAEEAIKGLN 
GQKPLGAREPIWKFANNPSQKTGC^LLTHLYQSSARRYAGPLHHQTQRFRLDNLLNMAYAVKRFSPIAIDGMSGLAGVGLSGGAAG 
GWCIFWNLSPEPDQSVLWQLFGPFGAWNVKVIRDFTTNKCKGFGFMTMTNYDEAAMAIASLNGYRLGQRVLQVSFKTSKQHKA 
(-1)ORF (SEQ ID NO: 58) 

MESQVG GARPAGLPNGHSLVQMEPLTTARPTSSSTTCPRT 
(+1)/{-2)ORF (SEQ ID NO: 59) 
ME$QVGG(G )PGRPACPTATPWYKWSH 

MAC30X 

(wt)ORF (SEQ ID NO: 60) 

LFSHQRVQAQPTDYGGSFTRRCVEWLLGLYFLSHIPITLFMDLQAWPRELYPVEFRNLLKVVYAKEFKDPLLQEPPAWFKSFLFCELV 

FQLPFFPIATYAFLKGSCKWIRTPAIIYSVHTMTTLILILSTFLFEDFSKASGFKGQRPETLHERLTLVSWAPYLLIPFILLIFMLRS 

EEKRKKK 

(-1)ORF (SEQ ID NO: 61) 

LFSHQRVQAQPTDYGGSFTRRCVEWLLGLYFLSHIPITLFMDLQAWPRELYPVEFRNLLKVVYAKEFKDPLLQEPPAWFKSFLFCELV 
FQLPFFPIATYAFLKGSCKWiRTPAIIYSVHTMTTLILlLSTFLFEDFSKASGFKGQRPETLHERLTLVSWAPYLLIPFILLIFMLRSPYYKY 
EEKRK KNEGNNHWPRVEMPTGWLLVGYIQEHCSEPTSSAAFETLAAMHKSKMVSGTMSNPHLLPFFFFF 
(+1)/(-2)ORF (SEQ ID NO: 62) 

LFSHQRVQAQPTDYGGSFTRRCVEWLLGLYFLSHIPITLFMDLQAWPRELYPVEFRNLLKWYAKEFKDPLLQEPPAWFKSFLFCELV 

FQLPFFPIATYAFLKGSCKWIRTPAIIYSVHTMTTULILSTFLFEDFSKASGFKGQRPETLHERLTLVSWAPYLLIPFILL 

EEKRKK(K )MKETTTGPG 

SLC4A3 

(wt)ORF (SEQ ID NO: 63) 

MANGViPPPGGASPLPQVRVPLEEPPLSPDVEEEDDDLGKTLAVSRFGDUSKPPAWDPEKPSRSYSERDFEFHRHTSHHTHHPLSA 

RLPPPHKLRRLPPTSARHTRRKRKKEKTSAPPSEGTPPiQEEGGAGVDEEEEEEEEEEGESEAEPVEPPPSGTPQKAKFSIGSDEDD 

SPGLPGRAAVTKPLPSVGPHTDKSPQHSSSSPSPRARASRLAGEKSRPWSPSASYDLRERLCPGSALGNPGGPEQQVPTDEAEAQ 

MLGSADLDDMKSHRLEDNPGVRRHLVKKPSRTQGGRGSPSGLAPILRRKKKKKKLDRRPHEVFVELNELMLDRSQEPHWRETARW 

lKFEEDVEEETERWGKPHVASLSFRSLLELRRTIAHGAALLDLEQTTLPGIAHLWETMIVSDQIRPEDRASVLRTLLLKHSHPNDDKDS 

GFFPRNPSSSSMNSVLGNHHPTPSHGPDGAVPTMADDLGEPAPLWPHDPDAKEKPLHMPGGDGHRGKSLKLLEKIPEDAEATWL 

VGCVPFLEQPAAAFVRLNEAVLLESVLEVPVPVRFLFVMLGPSHTSTDYHELGRSIATLMSDKLFHEAAYQADDRQDLLSAISEFLDG 

SIVIPPSEVEGRDLLRSVAAFQRELLRKRREREQTKVEMTTRGGYTAPGKELSLELGGSEATPEDDPLLRTGSVFGGLVRDVRRRYP 

HYPSDLRDALHSQCVAAVLFIYFAALSPAiTFGGLLGEKTEGLMGVSELIVSTAVLGVLFSLLGAQPLLWGFSGPLLVFEEAFFKFCRA 

QDLEYLTGRVWVGLWLWFVU\LVAAEGSFLV^^ 

ALPPTEGPPSPRNQPNTALLSLILMLGTFFIAFFLRKFRNSRFLGGKARRilGDFGIPISlLVMVLVDYSlTDTYTQKLWPTGLSVTSPDK 

RSWFJPPLGSARPFPPWMMVAAAVPALLVLILIFMETQITALIVSQKARRLLKGSGFHLDLLLIGSLGGLCGLFGLPWLTAAW 

NALTVMRTAIAPGDKPQIQEVREQRWGVLIASLVGLSiVMGAVLRRI^ 

KVKTWRMHLFTCIQLGCIALLWWKSTAASLAFPFLLLLWPLRHCLLPRLFQDRELQALDSEDAEPNFDEDGQDEYNELH 
(-1)ORF (SEQ ID NO: 64) 

MANGVIPPPGGASPLPQVRVPLEEPPLSPDVEEEDDDLGKTLAVSRFGDUSKPPAWDPEKPSRSYSERDFEFHRHTSHHTHHPLSA 
Rl PPPHKl RRI PPTRARHTRRKRKKFKTSAPPSEGTPPiQEEGGAGVDEEEEEEEEEEGESEAEPVEPPP QGPHRRQSSPLEW-RM 
WQASLGGLLSPSPCPRWAHTLTRAPSTPAAPPAPGPGPPDSLGRKAGPGAHRPVMTCGSDCAQAVPWATQVVQSSRCPQMRRR 
PRCWVLQTWTT 
(+1)/(-2)ORF (SEQ ID NO: 65) 

MANGVIPPPGGASPLPQVRVPLEEPPLSPDVEEEDDDLGKTLAVSRFGDLISKPPAWDPEKPSRSYSERDFEFHRHTSHHTHHPLSA 
Rl PPPHKl RRI PPTSARHTRRKRKKEKTSAPPSEGTPPlQEEGGAGVDEEEEEEEEEEGESEAEPVEPPfP ^RDPrEGK^HlVK 
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PRKDC 

(wt)ORF (SEQ ID NO: 66) 

MAGSGAGVRCSLLRLQETLSAADRCGAALAGHQLIRGLGQECVLSSSPAVLALQTSLVFSRDFGLLVFVRKSLNSIEFRECREEJLKFL 
CIFLEKMGQKIAPYSVEIKNTCTSNATKDRAAKCKIPALDLLIKLLQTFRSSRLMDEFKIGELFSKFYGELALKKKIPDTVLEKWELLGLL 
GEVHPSEM1NNAENLFRAFLGELKTQMTSAVREPKLPVLA 

GLRLFALHASQFSTCLLDNYVSLFEVLLKWCAHTNVELKKAALSALESFLKQVSNMVAKNAEMHKNKLQYFMEQFYGIIRNVDSNNKE 

LSfAIRGYGLFAGPCKVINAKDVDFMWELiQRCKQMFLTQTDTGDDRWQMPSFLQSVASVLLYLDTVPENATPVLEHLWMQIDSFP 

QYSPKMQLVCCRAIVKVFU\LAAKGPVLRNCISTWHQGL1RICSKPWLPKGPESESEDHRASGEVRTGKWKVPTYKDW 

SSDQMMDSILADEAFFSVNSSSESLNHLLYDEFVKSVLKIVEKLDLTLEIQTVGEQENGDEAPGNAA^MIPTSDPAANLHPAKPKDFSAFI 

NLVEFCREILPEKQAEFFEPWVYSFSYELILQSTRLPLISGFYKLLSITVRNAKKIKYFEGS 

(-1)ORF (SEQ ID NO: 67) 

MAGSGAGVRCSLLRLQETLSAADRCGAALAGHQLIRGLGQECVLSSSPAVLALQTSLVFSRDFGLLVFVRKSLNSIEFRECREEILKFL 

CIFLEKMGQKIAPYSVEIKNTCTSNATKDRAAKCKIPALDLLIKLLQTFRSSRLMDEFKIGELFSKFYGELALKKKY^ 

(+1)/(-2)ORF (SEQ ID NO: 68 

MAGSGAGVRCSLLRLQETLSAADRCGAALAGHQLIRGLGQECVLSSSPAVLALQTSLVFSRDFGLLVFVRKSLNSIEFRECREEILKFL 
CIFLEKMGQKIAPYSVEIKNTCTSVYTKDRAAKCKIPAL^ 

UVRAG 

(wt)ORF) (SEQ ID NO: 69) 

MSASASVGGPVPQPPPGPAAALPPGSAARALHVELPSQQRRLRHLRNIAARNIVNRNGHQLLDTYFTLHLCSTEKIYKEFYRSEVIKN 

SLNPTWRSLDFGIMPDRLDTSVSCFWKIWGGKENIYQLLIEWKVCLDGLKYLGQQIHARNQNEIIFGLNDGYYGAPFEHKGYSNAQK 

TILLQVDQNCVRNSYDVFSLLRLHRAQCAIKQTQVTVQKIGKEIEEKLRLTSTSNELKKKSECLQLKILVLQNELERQKKALGREVALLH 

KQQIALQDKGSAFSAEHLKLQLQKESLNELRKECTAKRELFLKTNAQLTIRCRQLLSELSYIYPIDLNEHKDYFVCGVKLPNSEDFQAK 

DDGSIAVALGYTAHLVSMiSFFLQVPLRYPIIHKGSRSTIKDNINDKLTEKEREFPLYPKGGEKLQFDYGWLLNKNIAQLRYQHGLGTP 

DLRQTLPNLKNFMEHGLMVRCDRHHTSSAIPVPKRQSSIFGGADVGFSGGIPSPDKGHRKRASSENERLQYKTPPPSYNSALAQPW 

TVPSMGETERKITSLSSSLDTSLDFSKENKKKGEDLVGSLNGGHANIVHPSQEQGEALSGHRATVNGTLLPSEQAGSASVQLPGEFH 

PVSEAELCCTVEQAEEIIGLEAQVSPQVIS 

(-1)ORF (SEQ ID NO: 70) 

MSASASVGGPVPQPPPGPAAALPPGSAARALHVELPSQQRRLRHLRNIAARNIVNRNGHQLLDTYFTLHLCSTEKIYKEFYRSEVIKN 
SLNPTWRSLDFGIMPDRLDTSVSCFWKIWGGKENIYQLLIEWKVCLDGLKYLGQQIHARNQNEIIFGLNDGYYGAPFEHKGYSNAQK 
TILLQVDQNCVRNSYDVFSLLRLHRAQCAIKQTQVWQKIGK^^ 
(+1 )/(-2)ORF (SEQ ID NO: 71 ) 

MSASASVGGPVPQPPPGPAAALPPGSAARALHVELPSQQRRLRHLRNIAARNIVNRNGHQLLDTYFTLHLCSTEKIYKEFYRSEVIKN 
SLNPTWRSLDFGIMPDRLDTSVSCFWKIWGGKENIYQLLiEWKVCLDGLKYLGQQIHARNQNEllFGLNDGYYGAPFEHKGYSNAQK 
TILLQVDQNCVRNSYDVFSLLRLHRAQCAIKQTQVTVQKIGKEIEEKLRLTSTSNELKKK/Kl 

MSH3 

(wt)ORF (SEQ ID NO: 72) 

MSRRKPASGGLAASSSAPAR(^VLSRFFQSTGSLKSTSSSTGAADQVDPGAAAAAAAAAAAAPPAPPAPAFPPQLPPHVATE 

KKRPLENDGPVKKKVKKVQQKEGGSDLGMSGNSEPKKCLRTRNVSKSLEKLKEFCCDSALPQSRVQTESLQERFAVLPKCTDFDDI 

SLLHAKNAVSSEDSKRQINQKDTTLFDLSQFGSSNTSHENLQKTASKSANKRSKSIYTPLELQYIEMKQQHKDAVLCVECGYKYRFFG 

EDAEIAARELNIYCHLDHNFMTASIPTHRLFVHVRRLVAKGYKVGWKQTETAALKAIGDNRSSLFSRKLTALYTKSTLIGEDVNPLIKLD 

DAVNVDEIMTDTSTSYLLCISENKENVRDKKKGNIFIGIVGVQPATGEWFDSFQDSASRSELETRMSSLQPVELLLPSALSEQTEALIH 

RATSVSVQDDRIRVERMDNIYFEYSHAFmWEFYAKDTVDiKGSQIISGIVNLEKPVICSLAAIIKYLKEFNLE 

EFMTINGTTLRNLEILQNQTDMKTKGSLLWVLDHTKTSFGRRKLKKWVTQPLLKLREINARLDAVSEVLHSESSVFGQIENHLRKLPDI 

GRGLCSIYHKKCSTQEFFLIVKTLYHLKSEFQAllPAVNSHIQSDLLRTVILEIPELLSPVEHYLKILNECW\KVGDKTELFKD 

RKDEIQGViDEIRMHLQEIRKILKNPSAQYVTVSGQEFMIEIKNSAVSCiPTDWVKVGSTKAVSRFHSPFIVENYRHLNQLREQLVLDCS 

AEWLDFLEKFSEHYHSLCKAVHHLAWDCIFSl^KVAKQGDYCRPTVQEERKIVIKNGRHPVIDVLLGEQDQYVPNNTDLSEDSERVM 

IITGPNMGGKSSYIKQVALITIMAQIGSWPAEEATIGIVDGIFTRMGAADNIYKGRSTFMEELTDTAEIIRKATSQSLVILDELGRGTSTH 

DGIAIAYATLEYFIRDVKSLTLFVTHYPPVCELEKNYSHQVGNYHMGFLVSEDESKLDPGTAEQVPDFVTFLYQITRGIAARSYGLNVA 

KLADVPGEILKKAAHKSKELEGLINTKRKRLKYFAKLWTMHNAQDLQKWTEEFNMEETQTSLLH 

(-1)ORF(SEQIDNO:73) 

MSRRKPASGGLAASSSAPARQAVLSRFFQSTGSLKSTSSSTGAADQVDPGAAAAAAAAAAAAPPAPPAPAFPPQLPPHVATEID 

KKRPLENDGPVKKKVKKVQQKEGGSDLGMSGNSEPKKCLRTRNVSKSLEKLKEFCCDSALPQSRVQTESLQERFAVLPKCTDFDDI 

SLLHAKNAVSSEDSKRQJNQKDTTLFDLSQFGSSNTSHENLQKTASKSANKRSKSIYTPLELQYIEMKQQHKDAVLCVECGYKYRFFG 

EDAElAARELNIYCHLDHNFMTASIPTHRLFVHVRRLVAKGYKVGWKQTETAALKAiGDNRSSLFSRKLTALYTKSTLIGEDVNPLIKLD 

DAVMyDE\MTDJSTSYLLC\SBNKEHyRDK KRATFLLALWBCSLPQARLCLIVSRTLLLVQS 

(+1)/(-2)ORF (SEQ ID NO: 74) 

MSRRKPASGGb^ASSSAPAR^VLSRFFQSTGSLKSTSSSTGAADQV 

KKRPLENDGPVKKKVKKVQQKEGGSDLGMSGNSEPKKCLRTRNVSKSLEKLKEFCCDSALPQSRVQTESLQERFAVLPKCTDFDDI 
SLLHAKNAVSSEDSKRQINQKDTTLFDLSQFGSSNTSHENLQKTASKSANKRSKSIYTPLELQYIEMKQQHKDAVLCVECGYKYRFFG 
x EDAEIAARELNIYCHLDHNFMTASIPTHRLFVHVRRLVAKGYKVGWKQTETAALKAIGDNRSSLFSRKLTALYTKSTLIGEDVNPLIKLD 
DAVNVDElMTDTSTSYLLClSENKENVRPKK(K >GQHFYWHCGSAACH/?f?GC\/ 



ACVR2, 

(wt)ORF (SEQ ID NO: 107) 

MGAAAKLAFA VFLISCSSGA ILGRSETQEC LFFNANWEKD RTNQTGVEPC YGDKDKRRHC FATWKNISGS IEIVKQGCWL 
DDINCYDRTD CVEKKDSPEV YFCCCEGNMC NEKFSYFPEM EVTQPTSNPV TPKPPYYNIL LYSLVPLMLI AGIVICAFWV 
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YRHHKMAYPP VLVPTQDPGP PPPSPLLGLK PLQLLEVKAR GRFGCVWKAQ LLNEYVAVKI FPIQDKQSWQ NEYEVYSLPG 
MKHENILQF1 GAEKRGTSVD VDLWLITAFH EKGSLSDFLK ANWSWNELC HIAETMARGL AYLHEDIPGL KDGHKPAISH 
RDIKSKNVLL KNNLTACIAD FGLALKFEAG KSAGDTHGQV GTRRYMAPEV LEGAINFQRD AFLRIDMYAM GLVLWELASR 
CTAADGPVDE YMLPFEEEIG QHPSLEDMQE VWHKKKRPV LRDYWQKHAG MAMLCETIEE CWDHDAEARL SAGCVGERIT 
QMQRLTNIIT TEDIVTWTM VTNVDFPPKE SSL* 

A8, Pos. 451: -1 ORF (Mutrate 16.3%) (SEQ ID NO: 108) 

MGAAAKLAFA VFLISCSSGA ILGRSETQEC LFFNANWEKD RTNQTGVEPC YGDKDKRRHC FATWKNISGS IEIVKQGCWL 
DDINCYDRTD CVE KKTALKY IFVAVRAICV MKSFLIFRRW KSHSPLQIQL HLSHPITTSC SIPWCHLC 

A8 Pos. 1476: -1 (Mutrate 81.6%) (SEQ ID NO: 109) 

MGAAAKLAFA VFLISCSSGA ILGRSETQEC LFFNANWEKD RTNQTGVEPC YGDKDKRRHC FATWKNISGS IEIVKQGCWL 
DDINCYDRTD CVEKKDSPEV YFCCCEGNMC NEKFSYFPEM EVTQPTSNPV TPKPPYYNIL LYSLVPLMLI AGJVICAFWV 
YRHHKMAYPP VLVPTQDPGP PPPSPLLGLK PLQLLEVKAR GRFGCVWKAQ LLNEYVAVKI FPIQDKQSWQ NEYEVYSLPG 
MKHENILQFI GAEKRGTSVD VDLWLITAFH EKGSLSDFLK ANWSWNELC HIAETMARGL AYLHEDIPGL KDGHKPAISH 
RDIKSKNVLL KNNLTACIAD FGLALKFEAG KSAGDTHGQV GTRRYMAPEV LEGAINFQRD AFLRIDMYAM GLVLWELASR 
CTAADGPVDE YMLPFEEEIG QHPSLEDMQE VWHKKRGLF * 

FLJ1 1 053, A11 Pos. 1695, Mutrate 52.2% 
wtORF (SEQ ID NO: 110) 

MVLRKLSKKD VTTKLKAMQE FGTMCTERDT ETVKGVLPYW PRIFCKISLD HDRRVREATQ QAFEKLTLKV KKQLAPYLKS 
LMGYWLMAQC DTYTPAAFAA KDAFEAAFPP SKQPEAIAFC KDEITSVLQD HLIKETPDTL SDPQTVPEEE REAKFYRWT 
CSLLALKRLL CLLPDNELDS LEEKFKSLLS QNKFWKYGKH SVPQIRSAYF ELVSALCQRI PQLMKEEASK VSPSVLLSID 
DSDPIVCPAL WEAVLYTLTT IEDCWLHVNA KKSVFPKLST VIREGGRGLA TVIYPYLLPF ISKLPHSITN PKLDFFKNFL 
TSLVAGLSTE RTKTSSSESS AVISAFYECL RFIMQQNLGE EEIEQMLVND QLIPFIDAVL KDPGLQHGQL FNHLAETLSS 
WEAKADTEKD EKTAHNLENV LIHFWERLSE ICVAKISEPE ADVESVLGVS NLLQVLQKPK SSLKSSKKKN GKVRFADEIL 
ESNKENEKCV SSEGEKIEGW ELTTEPSLTH NSSGLLSPLR KKPLEDLVCK LADISINYVN ERKSEQHLRF LSTLLDSFSS 
SRVFKMLLGD EKQSIVQAKP LEIAKLVQKN PAVQFLYQKL IGWLNEDQRK DFGFLVDJLY SALRCCDNDM 

-1 ORF (SEQ ID NO: 111) 

MVLRKLSKKD VTTKLKAMQE FGTMCTERDT ETVKGVLPYW PRIFCKISLD HDRRVREATQ QAFEKLTLKV KKQLAPYLKS 
LMGYWLMAQC DTYTPAAFAA KDAFEAAFPP SKQPEAIAFC KDEITSVLQD HLIKETPDTL SDPQTVPEEE REAKFYRWT 
CSLLALKRLL CLLPDNELDS LEEKFKSLLS QNKFWKYGKH SVPQIRSAYF ELVSALCQRI PQLMKEEASK VSPSVLLSID 
DSDPIVCPAL WEAVLYTLTT IEDCWLHVNA KKSVFPKLST VIREGGRGLA TVIYPYLLPF ISKLPHSITN PKLDFFKNFL 
TSLVAGLSTE RTKTSSSESS AVISAFYECL RFIMQQNLGE EEIEQMLVND QLIPFIDAVL KDPGLQHGQL FNHLAETLSS 
WEAKADTEKD EKTAHNLENV LIHFWERLSE ICVAKISEPE ADVESVLGVS NLLQVLQKPK SSLKSSKK KM VRLDLLMRYL 
KAIKRMKNVY LQKERRLKAG N* 

-2 ORF (SEQ ID NO: 112) 

MVLRKLSKKD VTTKLKAMQE FGTMCTERDT ETVKGVLPYW PRIFCKISLD HDRRVREATQ QAFEKLTLKV KKQLAPYLKS 

LMGYWLMAQC DTYTPAAFAA KDAFEAAFPP SKQPEAIAFC KDEITSVLQD HLIKETPDTL SDPQTVPEEE REAKFYRWT 
CSLLALKRLL CLLPDNELDS LEEKFKSLLS QNKFWKYGKH SVPQIRSAYF ELVSALCQRI PQLMKEEASK VSPSVLLSID 

DSDPIVCPAL WEAVLYTLTT IEDCWLHVNA KKSVFPKLST VIREGGRGLA TVIYPYLLPF ISKLPHSITN PKLDFFKNFL 

TSLVAGLSTE RTKTSSSESS AVISAFYECL RFIMQQNLGE EEIEQMLVND QLIPFIDAVL KDPGLQHGQL FNHLAETLSS 
WEAKADTEKD EKTAHNLENV LIHFWERLSE ICVAKISEPE ADVESVLGVS NLLQVLQKPK SSLKSSKKKVV* 

+ 1 ORF (SEQ ID NO: 113) 

MVLRKLSKKD VTTKLKAMQE FGTMCTERDT ETVKGVLPYW PRIFCKISLD HDRRVREATQ QAFEKLTLKV KKQLAPYLKS 
LMGYWLMAQC DTYTPAAFAA KDAFEAAFPP SKQPEAIAFC KDEITSVLQD HLIKETPDTL SDPQTVPEEE REAKFYRWT 
CSLLALKRLL CLLPDNELDS LEEKFKSLLS QNKFWKYGKH SVPQIRSAYF ELVSALCQRI PQLMKEEASK VSPSVLLSID 
DSDPIVCPAL WEAVLYTLTT IEDCWLHVNA KKSVFPKLST VIREGGRGLA TVIYPYLLPF ISKLPHSITN PKLDFFKNFL 
TSLVAGLSTE RTKTSSSESS AVISAFYECL RFIMQQNLGE EEIEQMLVND QLIPFIDAVL KDPGLQHGQL FNHLAETLSS 
WEAKADTEKD EKTAHNLENV LIHFWERLSE ICVAKISEPE ADVESVLGVS NLLQVLQKPK SSLKSSKKKK Wf 

KIAA1 052, A1 1 Pos. 689, Mutrate 42.2% 
Wt ORF (SEQ ID NO: 114) 

MAGRPLRIGD QLVLEEDYDE TYIPSEQEIL EFAREIGIDP IKEPELMWLA REGIVAPLPG EWKPCQDITG DIYYFNFANG 
QSMWDHPCDE HYRSLVIQER AKLSTSGAIK KKKKKKEKKD KKDRDPPKSS LALGSSLAPV HVPLGGLAPL RGLVDTPPSA 
LRGSQSVSLG SSVESGRQLG ELMLPSQGLK TSAYTKGLLG SIYEDKTALS LLGLGEETNE EDEEESDNQS VHSSSEPLRN 
LHLDIGALGG DFEYEESLRT SQPEEKKDVS LDSDAAGPPT PCKPSSPGAD SSLSSAVGKG RQGSGARPGL PEKEENEKSE 
PKICRNLVTP KADPTGSEPA KASEKEAPED TVDAGEEGSR REEAAKEPKK KASALEEGSS DASQELEISE HMKEPQLSDS 
IASDPKSFHG LDFGFRSRIS EHLLDVDVLS PVLGGACRQA QQPLGIEDKD DSQSSQDELQ SKQSKGLEER YHRLSPPLPH 
EERAQSPPRS LATEEEPPQG PEGQPEWKEA EELGEDSAAS LSLQLSLQRE QAPSPPAACE KGKEQHSQAE ELGPGQEEAE 
DPEEKVAVSP TPPVSPEVRS TEPVAPPEQL SEAALKAMEE AVAQVLEQDQ RHLLESKQEK MQQLREKLCQ EEEEEILRLH 
QQKEQSLSSL RERLQKAIEE EEARMREEES QRLSWLRAQV QSSTQADEDQ IRAEQEASLQ KLREELESQQ KAERASLEQK 
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NRQMLEQLKE EIEASEKSEQ AALNAAKEKA LQQLREQLEG ERKEAVATLE KEHSAELERL CSSLEAKHRE WSSLQKKIQ 
EAQQKEEAQL QKCLGQVEHR VHQKSYHVAG YEHELSSLLR EKRQEVEGEH ERRLDKMKEE HQQVMAKARE QYEAEERKQR 
AELLGHLTGE LERLQRAHER ELETVRQEQH KRLEDLRRRH REQERKLQDL ELDLETRAKD VKARLALLEV QEETARREKQ 
QLLDVQRQVA LKSEEATATH QQLEEAQKEH THLLQSNQQL REILDELQAR KLKLESQVDL LQAQSQQLQK HFSSLEAEAQ 
KKQHLLREVT VEENNASPHF EPDLHIEDLR KSLGTNQTKE VSSSLSQSKE DLYLDSLSSH NVWHLLSAEG VALRSAKEFL 
VQQTRSMRRR QTALKAAQQH WRHELASAQE VAKDPPGIKA LEDMRKNLEK ETRHLDEMKS AMRKGHNLLK KKEEKLNQLE 
SSLWEEASDE GTLGGSPTKK AVTFDLSDMD SLSSESSESF SPPHLDSTPS LTSRKIHGLS HSLRQiSSQL SSVLS1LDSL 
NPQSPPPLLA SMPAQLPPRD PKSTPTPTYY GSLARFSALS SATPTSTQWA WDSGQGPRLP SSVAQTVDDF LLEKWRKYFP 
SGIPLLSNSP TPLESRLGYM SASEQLRLLQ HSHSQVPEAG STTFQGIIEA NRRWLERVKN DPRLPLFSST PKPKATLSLL 
QLGLDEHNRV KVYRF* 

- 1 ORF (SEQ ID NO: 115) 

MAGRPLRIGD QLVLEEDYDE TYIPSEQEIL EFAREIGIDP IKEPELMWLA REGIVAPLPG EWKPCQDITG DIYYFNFANG 
QSMWDHPCDE HYRSLVIQER AKLSTSGAIK MKK KRKRKT RRTETPPKVR WPWVPhT 

-20RF (SEQ ID NO: 116) 

MAGRPLRIGD QLVLEEDYDE TYIPSEQEIL EFAREIGIDP IKEPELMWLA REGIVAPLPG EWKPCQDITG DIYYFNFANG 
QSMWDHPCDE HYRSLVIQER AKLSTSGAIK KKKK KGKERQ EGQRPPQKFA GLGFLISPSS CSSWGPGSFT RSCGYPTLCS 
SWISKREPGE LSGVWTSAWR THAAFT GSQD LCL YKGSL GLHP 

+ 1 ORF (SEQ ID NO: 117) 

MAGRPLRIGD QLVLEEDYDE TYIPSEQEIL EFAREIGIDP IKEPELMWLA REGIVAPLPG EWKPCQDITG DIYYFNFANG 
QSMWDHPCDE HYRSLVIQER AKLSTSGAIK KKKKK KGKE/? QEGQRPPQKF AGLGFLISPS SCSSWGPGSF TRSCGYPTLC 
SSWISKREPG ELSGVWTSA W RTHAAFTGSQ D LCLYKGSLG LHI* 



